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Abstract. Automatic structures are finitely presented structures where the universe and all 
relations can be recognized by finite automata. It is known that the isomorphism problem for 
automatic structures is complete for S\ ; the first existential level of the analytical hierarchy. 
Several new results on isomorphism problems for automatic structures are shown in this 
paper: (i) The isomorphism problem for automatic equivalence relations is complete for 
IIi (first universal level of the arithmetical hierarchy), (ii) The isomorphism problem for 
. automatic trees of height n > 2 is J72„_3-complete. (iii) The isomorphism problem for 

automatic linear orders is not arithmetical. This solves some open questions of Khoussainov, 
Rubin, and Stephan. 
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1 Introduction 



The idea of an automatic structure goes back to Biichi and Elgot who used finite automata to 
qq ' decide, e.g., Presburger arithmetic [5]. Automaton decidable theories [5] and automatic groups [5] 

are similar concepts. A systematic study was initiated by Khoussainov and Nerode [13j who also 
coined the name 11 automatic structure" . In essence, a structure is automatic if the elements of the 
universe can be represented as strings from a regular language and every relation of the structure 
can be recognized by a finite state automaton with several heads that proceed synchronously. 
Automatic structures received increasing interest over the last years [11311 111411511612^] . One of 
the main motivations for investigating automatic structures is that their first-order theories can 
be decided uniformly (i.e., the input is an automatic presentation and a first-order sentence). 

Automatic structures form a subclass of recursive (or computable) structures. A structure is 
recursive, if its domain as well as all relations are recursive sets of finite words (or naturals). A well- 
studied problem for recursive structures is the isomorphism problem, where it is asked whether two 
given recursive structures over the same signature (encoded by Turing-machines for the domain 
and all relations) are isomorphic. It is well known that the isomorphism problem for recursive 
structures is complete for the first level of the analytical hierarchy S\. In fact, S\ -completeness 
holds for many subclasses of recursive structures, e.g., for linear orders, trees, undirected graphs, 
Boolean algebras, Abelian p-groups, see [417] . ^-completeness of the isomorphism problem for a 
class of recursive structures implies non-existence of a good classification (in the sense of [I]) for 
that class |J. 

In [14] . it was shown that also for automatic structures the isomorphism problem is S\- 
complete. By a direct interpretation, it follows that for the following classes the isomorphism 
problem is still -complete |18j : automatic successor trees, automatic undirected graphs, auto- 
matic commutative monoids, automatic partial orders, automatic lattices of height 4, and auto- 
matic 1-ary functions. On the other hand, the isomorphism problem is decidable for automatic 
ordinals [TS] and automatic Boolean algebras [Tl]. An intermediate class is the class of all locally- 
finite automatic graphs, for which the isomorphism problem is complete for II® (third level of the 
arithmetical hierarch\rf) [2T1. 



* The second and third author are supported by the DFG research project GELO. 
1 For background on the arithmetical hierarchy see, e.g., |19| . 



For many interesting classes of automatic structures, the exact status of the isomorphism 
problem is open. In the recent survey 22 it was asked for instance, whether the isomorphism 
problem is decidable for automatic equivalence relations and automatic linear orders. For the 
latter class, this question was already asked in [15j . In this paper, we answer these questions. Our 
main results are: 

— The isomorphism problem for automatic equivalence relations is Tl^-complete. 

— The isomorphism problem for automatic successor trees of finite height k > 2 (where the 
height of a tree is the maximal number of edges along a maximal path) is /I^^g-complcte. 

— The isomorphism problem for automatic linear orders is hard for every level of the arithmetical 
hierarchy. 

Most hardness proofs for automatic structures, in particular the ^-hardness proof for the iso- 
morphism problem of automatic structures from |14j . use transition graphs of Turing- machines 
(these graphs are easily seen to be automatic). This technique seems to fail for inherent reasons, 
when trying to prove our new results. The reason is most obvious for equivalence relations and 
linear orders. These structures are transitive but the transitive closure of the transition graph of a 
Turing-machine cannot be automatic in general (it's first-order theory is undecidable in general). 
Hence, we have to use a new strategy. Our proofs are based on the undecidability of Hilbert's 
10 th problem. Recall that Matiyasevich proved that every recursively enumerable set of natural 
numbers is Diophantine [17] . This fact was used by Honkala to show that it is undecidable whether 
the range of a rational power series is N [5] ■ Using a similar encoding, we show that the isomor- 
phism problem for automatic equivalence relations is TT^-complete. Next, we extend our technique 
in order to show that the isomorphism problem for automatic successor trees of height k > 2 is 
il^^g-complete. In some sense, our result for equivalence relations makes up the induction base 
k = 2. Finally, using a similar but technically more involved reduction, we can show that the 
isomorphism problem for automatic linear orders is hard for every level of the arithmetical hier- 
archy In fact, since our proof is uniform on the levels in the arithmetical hierarchy, it follows that 
the isomorphism problem for automatic linear orders is at least as hard as true arithmetic (the 
first-order theory of (N; +, x)). At the moment it remains open whether the isomorphism problem 
for automatic linear orders is 17-f-complete. 

2 Preliminaries 

Let N+ = {1, 2, 3, . . .}. Let p{x\, . . . , x n ) G N[xi, . . . , x n ] be a polynomial with non-negative integer 
coefficients. We define 

lmg + (p) = {p(yi,...,y n ) \yi,---,y n £ 

If p is not the zero-polynomial, then Img + (p) C N+. 

Details on the arithmetical hierarchy can be found for instance in [19] . With E^ we denote the 
n th (existential) level of the arithmetical hierarchy; it is the class of all subsets iCfj such that 
there exists a recursive predicate P C N™ +1 with 

A = {a G N | 3xi Vx2 ■ • • Qx n : (a, x\, . . . , x n ) G P}, 

where Q = 3 (Q — V) for n odd (even). The set of complements of Z^-sets is denoted by 77°. 
By fixing some effective encoding of strings by natural numbers, we can talk about i7°-sets and 
77"-sets of strings over an arbitrary alphabet. A typical example of a set, which does not belong 
to the arithmetical hierarchy is true arithmetic, i.e., the first-order theory of (N; +, x), which we 
denote by FOTh(N;+, x). 

We assume basic terminologies and notations in automata theory (see, for example, [10]). For 
a fixed alphabet E, a non- deterministic finite automaton is a tuple A = (S*, A, I, F) where S is 
the set of states, A C S x E x S is the transition relation, I C S is a set of initial states, and 
F C S is the set of accepting states. A run of A on a word u = a\a2 ■ ■ ■ a n (ax, a2 . . . , a n G E) is 



2 



a word over A of the form r = (go, ai, <7i)(?i) Q2) • • • (?n-i, in, Qn), where go G J. If moreover 
<?„ £ .F, then r is an accepting run of „4 on u. We will only apply these definitions in case n > 0, 
i.e., we will only speak of (accepting) runs on non-empty words. 

Given two automata A\ = (Si,Ai,Ii,F%) and A2 = (S%,A2,Ii,Fi) over the same alphabet 
E, we use Ai W A2 to denote the automaton obtained by taking the disjoint union of Ai and A2. 
Note that for any word u £ E + , the number of accepting runs of A\ W.A2 on u is equal to the sum 
of the numbers of accepting runs of A\ and A2 on u. We use A\ x A2 to denote the Cartesian 
product of A\ and A2- It is the automaton (Si x S2,A,Ii x /2,-Fi x F2), where 

A = {{{Pl,P2),(J, (91,52)) I (pi,cr,qi) E 4i, (p 2 ,o-,q 2 ) £ ^2}- 

Then, clearly, the number of accepting runs of Ai x ^2 on a word u € -^(^1) H L(A 2 ) is the 
product of the numbers of accepting runs of and ^2 on u. In particular, if .Ai is deterministic, 
then the number of accepting runs of A\ x A2 on u £ £(Ai) PI £(^2) is the same as the number 
of accepting runs of A2 on u. In the following, if A is a non-deterministic automaton and D is a 
regular language, we write D W A (resp. I? Pi -4) for the automaton Ad W -4 (resp. Ad x „4), where 
Ad is some deterministic automaton for the language D. 

We use synchronous n-tape automata to recognize n-ary relations. Such automata have n input 
tapes, each of which contains one of the input words. The n tapes are read in parallel until all input 
words are processed. Formally, let E = ELi{o} where o ^ E. For words wi,W2,...,w n £ E* , their 
convolution is a word Wi <g) • • • ® w n £ (E%)* with length max{|u>i|, . . . , w„|}, and the k th symbol 
of W\ (E> • • • ® w n is (ai, . . . , a n ) where ai is the k th symbol of Wi if k < |iUf|, and 17^=0 otherwise. 
An n-ary relation i? is recognizable if the set of all convolutions of tuples (wi, . . . , w n ) £ R is 
a regular language. 

A relational structure S consists of a domain D and atomic relations on the set D. We will 
only consider structures with countable domain. If Si and £2 are two structures over the same 
signature and with disjoint domains, then we write 1S1 W S2 for the union of the two structures. 
Hence, when writing Si W 1S2, we implicitly express that the domains of Si and £2 are disjoint. 
More generally, if {Si \ i £ 1} is a class of pairwise disjoint structures over the same signature, 
then we denote with l±l{<Si i £ 1} the union of these structures. A structure <S is called automatic 
over E if its domain is a regular subset of E* and each of its atomic relations is FA recognizable; 
any tuple P of automata that accept the domain and the relations of S is called an automatic 
presentation of S; in this case, we write S(P) for S. If an automatic structure S is isomorphic to 
a structure S' , then S is called an automatic copy of S' and S' is automatically presentable. In 
this paper we sometimes abuse the terminology referring to S' as simply automatic and calling an 
automatic presentation of S also automatic presentation of S' . We also simplify our statements by 
saying "given/compute an automatic structure S" for "given/compute an automatic presentation 
P of a structure 5(P)". The structures (N; <,+) and (Q; <) are both automatic structures. On 
the other hand, (N; x) and (Q; +) have no automatic copies (see [12|22j and [24]). 

Consider FO + 3°° + 3 n,rn , the first-order logic extended by the quantifiers 3°° (there exist 
infinitely many) and 3 n ' m x (there exist finitely many and the exact number is congruent n modulo 
m, where m, n £ N). The following theorem from [2 8 1 3121) lays out the main motivation for 
investigating automatic structures. 

Theorem 1. From an automatic presentation P and a formula f(x) £ FO + 3°° + 3™> m in the 
signature of S (P) , one can compute an automaton whose language consists of those tuples a from 
S(¥) that make ip true. In particular, the FO + 3°° + 3 n ' m theory of any automatic structure S is 
(uniformly) decidable. 

Let K, be a class of automatic structures closed under isomorphism. The isomorphism problem for K, 
is the set of pairs (Pi,P2) of automatic presentations with <S(Pi) = S(^-i) £ K.. The isomorphism 
problem for the class of all automatic structures is complete for El — the first level of the analytical 
hierarchy p~4j (this holds already for automatic successor trees). However, if one restricts to special 
subclasses of automatic structures, this complexity bound can be reduced. For example, for the 
class of automatic ordinals and also the class of automatic Boolean algebras, the isomorphism 
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problem is decidable. Another interesting result is that the isomorphism problem for locally finite 
automatic graphs is TTg -complete [21]. All these classes of automatic structures have the nice 
property that one can decide whether a given automatic presentation describes a structure from 
this class. Theorem [T] implies that this property also holds for the classes of equivalence relations, 
trees of height at most k, and linear orders, i.e., the classes considered in this paper. 

3 Automatic Equivalence Structures 

An equivalence structure is of the form £ = (D; E) where E is an equivalence relation on D. 
In this section, we prove that the isomorphism problem for automatic equivalence structures is 
77^-complete. This result can be also deduced from our result for automatic trees (Section @|. But 
the case of equivalence structures is a good starting point for introducing our techniques. 

Let £ be an automatic equivalence structure. Define the function he ■ N U {Ho} — >• N U {Ho} 
such that for all n E N U {Ho}, hs{ri) equals the number of equivalence classes (possibly infinite) 
in £ of size n. Note that for given «£NU {Ho}, the value hs(n) can be computed effectively: one 
can define in FO + 3°° the set of all <n ox -least elemental that belong to an equivalence class of 
size n. 

Given two automatic equivalence structures £\ — (D\\E\) and £2 = (D 2 ;E 2 ), deciding if 
£ 1 = £ 2 amounts to checking if he 1 — hg 2 . Therefore, the isomorphism problem for automatic 
equivalence structures is in II®. 

For the 77° lower bound, we use a reduction from Hilbert's 10 th problem: Given a polynomial 
p(x%, . . . , Xk) G Z[xi, . . . , Xk], decide whether the equation p(xi, . . . , Xk) = has a solution in N+ 
(for technical reasons, it is useful to exclude in solutions). This problem is well-known to be 
undecidable, see e.g. [T7]. In fact, Matiyasevich constructed from a given (index of a) recursively 
enumerable set X C N+ a polynomial p(x±, . . . , Xk) G Z[xi, . . . , Xk] such that for all n G N+: n £ X 
if and only if 3^2, ■ • ■ ,Uk £ N+ : p(n, j/2, ■ • ■ , Uk) = 0. Hence, the following set is TJj'-complete: 

{(pi{x),p 2 (x)) e N[x u . . .,x k } 2 I Vc G : Pl (z) ? p 2 (c)}. 

For a symbol a, let denote the alphabet 

£Z = {a,«} k \ {(<>,...,«)} 

and let Cj denote the i th component of a € For e = (ei, . . . , e^) G write a e for the word 

a ei ® a e2 ® ■ ■ • <g> a ek . 

For a language i, we write Ofc(L) for the language 

{m ® u 2 ® ■ ■ • <g> Mfc I ui, . . . , Ufc G L}. 

Lemma 2. There exists an algorithm that, given a non-zero polynomial p(x) G N[x] m k variables, 
constructs a non- deterministic automaton A[p(x)] on the alphabet with I(A[p(x)]) = <S>k(ci + ) 
such that for all c G N'j:.' ^4[p(x)] has exactly p(c) accepting runs on input a c . 

Proof. The automaton A[p(x)] is build by induction on the construction of the polynomial p, the 
base case is provided by the polynomials 1 and xi. 

Let A\\\ be a deterministic automaton accepting (E>fe(a + ). Next, suppose p(x±, . . . , Xk) = x\ for 
some i G {1, . . . , fc}. Let S = {qi, q 2 }, I = {qi} and F = {(72}- Define A as 

A = {( qi ,o-, qj ) I j G {l,2},crG-Eg,(T < =o}U{(gia 1 (7 > ga) | cr G E%}. 

When the automaton «4[p(3;)] = (S, I, A, F) runs on an input word a c , it has exactly Cj many 
times the chance to move from state q\ to the final state q 2 ■ Therefore there are exactly c, = p(c) 
many accepting runs on a°. 

2 <iiox denotes the length-lexicographical order on words. 
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Let pi(x) and p 2 (x) be polynomials in N[x]. Assume as inductive hypothesis that there are two 
automata .A[pi(3;)] and .A^Cc)] such that for i G {1, 2} the number of accepting runs of ,A[pi(:r)] 
on of equals pi (c) . 

For p(x) — pi(x) + p2(x), set = _4[pi(z)] W A[p2(x)]. Then, the number of accepting 

runs of _4[p(:z;)] on a c is pi(c) + p2{c). 

For p(x) = pi(x) -p 2 (x), let = _4[pi(3;)] x ^l[p2(3;)]- Then, the number of accepting runs 

of _4[p(x)] on a c is pi(c) ■ p 2 {c). □ 

Let A = (S, I, A, F) be a non-deterministic finite automaton with alphabet S. We define an 
automaton Run^ = (5, /, A' , F) with alphabet A and 

A ' = {(Pi (P,a,q),q) \ {p,a,q) G A}. 

Let 7r : A* — > S* be the projection morphism with 7r(p, a, q) = a. The following lemma is imme- 
diate from the definition. 

Lemma 3. For u G A + we have: u G L(Run^) if and only if u forms an accepting run of A on 
7r(u) (which in particular implies n(u) G L(A)). 

This lemma implies that for all words w G S + , 7r~ 1 (u;)nL(Run./i)| equals the number of accepting 
runs of A on w. Note that this does not hold for w — e. 

Consider a non-zero polynomial p(x) € N[x±, . . . , Xf.]. Let the automaton A — A[p(x)] satisfy 
the properties guaranteed by Lemma [2] and let Run^ be as defined above. Define an automatic 
equivalence structure £ (p) whose domain is L(Run^) \ {e}. Moreover, two words u,v £ L(Rnn^) \ 
{s} are equivalent if and only if tt{u) — n(v). By definition and Lemma [3J a natural number 
y G N+ belongs to Img + (p) if and only if there exists a word u £ L(A) with precisely y accepting 
runs, if and only if £{p) contains an equivalence class of size y. 

It is well known that the function C:NxN^N with 

C(x, y) = (x + yf + 3x + y (1) 

is injective (C(x,y)/2 defines a pairing function, see e.g. jl]). In the following, let ^Good denote 
the countably infinite equivalence structure with 



^GoodM 



oo if n e {C(y, z) | y, z G N+,y ^ z] 
otherwise. 



Proposition 4. The set of automatic presentations P with <S(P) = fcood is hard for FI®. 

Proof. For non-zero polynomials pi(x) , p 2 (x) G N[sgj., . . . , Xk], define the following three (non-zero) 
polynomials from N[xi, . . . , Xk] (with fc > 2): 

Sl(W) — C(pi(x),p 2 (x)), S 2 (x) = C(X1 + X2,Xl), S3(x) = C(xi,xi + x 2 ). 

Let £"(5i), £"(52), and £(83) be the automatic equivalence structures corresponding to these poly- 
nomials according to the above definition. Finally, let £ be the disjoint union of No many copies 
of these three equivalence structures. 

If pi(c) = p 2 (c) for some c G N+, then there is y G N+ such that C(y, y) G Img + (5i). Therefore 
in £ there is an equivalence class of size C(y, y) and no such equivalence class exists in £ Good- Hence 

£ ¥ £Good- 

Conversely, suppose that pi(c) ^ P2(c) for all c G N+. For all y,z G N+, £ contains an 
equivalence class of size C(y, z) if and only if C(y, z) belongs to Img + (5i) Ulmg + (52) Ulmg + (53), 
if and only if y ^ z, if and only if £cood contains an equivalence class of size C(y, z). Therefore, for 
any s G N+ , £ contains an equivalence class of size s if and only if fcood contains an equivalence 
class of size s. Hence £ = £g o&- 

In summary, we have reduced the Tl^-hard problem 

{(pi(x),p 2 (x)) G N[xi, . . .,x k } 2 I k > 2, Vc G : Pl (c) ? p 2 (c)} 
to the set of automatic presentations of ^Good- Hence the proposition is proved. □ 
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Theorem 5. The isomorphism problem for automatic equivalence structures is IF^-complete. 

Proof. At the beginning of this section, we already argued that the isomorphism problem is in TT\; 
hardness follows immediately from Proposition [4j since £cood is necessarily automatic. □ 

4 Automatic Trees 

A tree is a structure T = (V; <), where < is a partial order with a least element, called the root, 
and such that for every x G V, the order < restricted to the set {y \ y < x} of ancestors of x is a 
finite linear order. The level of a node x G V is \{y \ y < x}\ G N. The height of T is the supremum 
of the levels of all nodes in V; it may be infinite, but this paper deals with trees of finite height 
only. One may also view a tree as a directed graph (V, E), where there is an edge (u, v) G E if and 
only if u is the largest element in {x \ x < v}. The edge relation E is FO-dcfinable in (V; <). In 
this paper, we assume the partial order definition for trees, but will quite often refer to them as 
graphs for convenience. We use T n to denote the class of automatic trees with height at most n. 
Let n be fixed. Then the tree order < is FO-definable in T and this holds even uniformly for all 
trees from T n . Moreover, it is decidable whether a given automatic graph belongs to T n (since the 
class of trees of height n can be axiomatized in first-order logic) . 

As a corollary to Proposition |4l we get immediately that the isomorphism problem for auto- 
matic trees of height at most 2 is undecidable: 

Corollary 6. There exists an automatic tree Tcood of height 2 such that the set of automatic 
presentations P with <S(P) = Tcood is Tl^-hard. Hence, the isomorphism problem for the class Ti 
of automatic trees of height at most 2 is II® -hard. 

Proof. Let £ = (V; =) be an automatic equivalence structure. Now build the tree T(£) as follows: 

— the set of nodes is V U {r} U {au \ u G V, u is <n ox -minimal in [u] = } where r and a are two 
new letters 

— r is the root, its children are the words starting with a, and the children of au are the words 
from [u] = . 

Then it is clear that T(£) is a tree of height at most 2 and that an automatic presentation for 
T{£) can be computed from one for £. Furthermore, £ = £cood if arid only if T{£) = T(fcood)- 
Hence, indeed, the statement follows from Proposition |4j □ 

The hardness statement of Theorem [TBI below is a generalization of this corollary to all the classes 
T n for n > 2. But first, we prove an upper bound for the isomorphism problem for T n - 

Proposition 7. The isomorphism problem for the class T n of automatic trees of height at most 
n is 

— decidable for n = 1 and 

— in Tl2 n _ 3 for all n > 2. 

Proof. We first show that T\ = T2 is decidable for automatic trees T\,T^ £ T\ of height at most 1: 
It suffices to compute the cardinality of T (i 6 {1, 2}) which is possible since the universes of T\ 
and T2 are regular languages. 

Now let n > 2 and consider T\,T% G Tn- Let T t = (Vi, Ei), w.l.o.g. ^[1^2= 0, and V = Vi UV 2 , 
E = Ei U i?2- For any node u in V, let T(u) denote the subtree (of either T\ or T2) rooted at 
u and let E{u) be the set of children of u. For k = n — 2, n — 3, . . . , 0, we will define inductively 
a 7T2 n _ 2 / i ,_3-predicate isOfc(ui, M2) for u\,U2 G V. This predicate expresses that T{u\) = T(u2) 
provided u\ and u-i belong to level at least k. The result will follow since T\ = T2 if and only if 
isoo(ri,r2) holds, where r a is the root of T a . 
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For k = n — 2, the trees T(u\) and T(u 2 ) have height at most 2 and we can define iso n _2(ui, u 2 ) 
as follows: 



Vk G NU {K } W > 1 



3o;i, • • • ,xt G -E'(ui) : f\ Xi ^ Xj A f\\E{Xi)\ = k 

l<i<j<£ i=l 

<S=^ 3yi, ...,we £?(u a ) : /\ Vi ^ Vj A f\ \E{ Vl )\ = k 

l<i<j<t i=l 



In other words: for every k G N U {No}, u\ and W2 have the same number of children with exactly 
k children. Since FO + 3°° is uniformly dccidable for automatic structures, this is indeed a -/re- 
sentence (note that 2n — 2k — 3 = 1 for k = n — 2). For < k < n — 2, we define isOfc(ui, u 2 ) 
inductively as follows: 



Vw G E(ux)UE(u 2 ) W > 1 



V 



3xi, ■ • ■ , x e G E(u%) : f\ Xi ^ Xj A f\ iso fc+ i(u, Xi) 
i<i<j<e 



i=l 

£ 



3y%,...,yt G E(u 2 ) : f\ Ui ^ Vj A f\ iso k+1 (v,yi) 

l<i<j<( i=l 



By quantifying over all v G E(u%) U £(1*2), we quantify over all isomorphism types of trees that 
occur as a subtree rooted at a child of ui or u 2 . For each of these isomorphism types r, we express 
that iti and U2 have the same number of children x with T(x) of type r. Since by induction, 
isOfc+i(w, Xi) and isOfc+i(u, t/j) are 7l2 n _ 2fc _ 5 -statcments, isOfc(iti, U2) is a il2 n _ 2fe _ 3 -statement. □ 

The rest of this section is devoted to proving that the isomorphism problem for the class T n of 
automatic trees of height at most n > 2 is also iT2„_ 3 -hard (and therefore complete). So let P n (xo) 
be a 772 n _ 3 -predicate. In the following lemma and its proof, all quantifiers with unspecified range 
run over N+. 

Lemma 8. For 2 < i < n, there are Il^^-predicates Pi(xo, x±, y\, x 2 , y 2 , ■ ■ ■ , x n _. j, y n -i) such 
that 

(i) Pi+i(x) is logically equivalent to Vx n _i3j/„_i : Pi(x, x n _j, y n -i) f or 2 <i <n and 
(H) ^Vn-i ■ ->Pi(x,x n -i,y n -i) implies Vx^ > x„_ 4 Vy n -i : -•Pi{x,x' n _ i ,y n -i), 



where x = (xq, x\, j/i, . ■ ■ , x T 



-1; Vn—i—l j 



Proof. The predicates Pi are constructed by induction, starting with i = n — 1 down to i = 2 
where the construction of Pi does not assume that (i) or (ii) hold true for Pj+i. 

So let 2 < i < n such that Pj_|_i(x) is a n^ i+1 - ) _ 3 -predic&te. Then there exists a 272i-3~P re dicate 
P(x,x n -i,y n -i) such that Pj_|_i(x) is logically equivalent to 

*^X n —i^y n ~i . P(x , X n — i , yn — z) ■ 

But this is logically equivalent to 

Vx n _i Vx n _i x n _i 3y n _i . P(x, x n _i , y n — i ) . (2) 

Let y>(x, x„_i) be 

Vx n _,j ^ x n —i 3y n —i . P(x, x n _i, yn—i) • 

Then for any x„_i G N, 

-i<£(x,x n _j) => Vx > x n _j : -199(3?, ac) . (3) 

Since Vx' r! _ i < x n ~i is a bounded quantifier, the formula if(x, x n _j) belongs to S 2i _ 2 ( see f° r 
example [23j p. 61]). Thus there is a i72i-3"P recuca t e Pi{x, x n-i,yn-i) such that 



ip(x,x n - 



(4) 
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Therefore ^ (and therefore Pi+i(x)) is logically equivalent to Vx n _i 3y n —i : Pi(x, x n —i, y n —i)- 
Moreover, 



Let us fix the predicates P, for the rest of Section @] By induction on 2 < i < n, we will construct 
the following trees: 

— test trees Ti G 71 for c G N 1 ^ 2 '" (which depend on Pi) and 

— trees U l K G % for k G N+ U {uj} (we assume the standard order on N+ U {w}). 

The idea is that T± = {7* if and only if /c = 1 + inf ({x„_j | Vy„_i G N + : -Pi(c, x n -%, y n -i)} U {w})- 
We will not prove this equivalence, but the following simpler consequences for any c G N + : 

(PI) Pi(c) holds if and only if T± S E#. 

(P2) Pi(c) does not hold if and only if T± = U l m for some m G N+. 

The first property is certainly sufficient for proving P^2 n _ 3 -hardness (with i = n), the second 
property and therefore the trees XJ l m for m < u> are used in the inductive step. We also need the 
following property for the construction. 

(P3) No leaf of any of the trees T± or U l K is a child of the root. 

In the following section, we will describe the trees T~ and U l K of height at most i and prove (PI) 
and (P2). Condition (P3) will be obvious from the construction. The subsequent section is then 
devoted to prove the effective automaticity of these trees. 

4.1 Construction of trees 

We start with a few definitions: A forest is a disjoint union of trees. Let Hi and H 2 be two forests. 
The forest H" is the disjoint union of countably many copies of H\. Formally, if H\ = (V, E), then 
H? = (V x N, E') with ((v, i), (w,j)) G E' if and only if (v, w) G E and i = j. We write H x - H 2 
for Hf = H% . Then Hi ~ H 2 if they are formed, up to isomorphism, by the same set of trees (i.e., 
any tree is isomorphic to some connected component of Hi if and only if it is isomorphic to some 
connected component of H 2 ). If If is a forest and r does not belong to the domain of H, then we 
denote with r o H the tree that results from adding r to H as new least element. 

4.1.1 Induction base: construction of and 17? For notational simplicity, we write k for 
1 + 2(n — 2). Hence, P 2 is a fc-ary predicate. By Matiyasevich's theorem, we find two non-zero 
polynomials pi{xi, . . . , xi), p 2 (xi, . . . , xg) G N[ac], I > k, such that for any c G N^: 



For two numbers m, n G N+, let T[m, n] denote the tree of height 1 with exactly C(m,n) leaves, 
where C is the injective polynomial function from ([!]). Then define the following forests: 




This shows (ii). 



□ 




H 2 




(+){T[m, n] | m, n G N+, m ^ n} 

/7 2 W |+J{T[pi(c,x) + ar/ + i,pa(c,5) + x i+ i] \ x G N^, x £+ i G N+} 
J7 2 W |+J{T[x, i] | a; G N+, x > «} for k G N+ U {uj} 
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T[pi(c,x) +xt+i, T[m,n] T[x,x] T[m,n\ 

p 2 (c,x) + x e +i] 

The tree Tj The tree U\ 

Fig. 1. The tree Tj and U 2 



Note that J 2 = H 2 . Moreover, the forests J 2 (k G N+ U {w}) are pairwise non-isomorphic, since 
C is injective. 

The trees T 2 and U 2 , resp., are obtained from H 2 and J 2 , resp., by taking countably many 
copies and adding a root: 

T# = ro(H|)« C/ K 2 =ro(J^, (5) 

see Figured! 

The following lemma (stating (PI) for the 7T°-predicate P 2 , i.e., for i = 2) can be proved in a 
similar way as Theorem [5] 

Lemma 9. For any c G NlL, we /lave: 

P 2 (c) Zio/ds if 2 ~ J 2 T? = U 2 . 

Proof. By ([5]), it suffices to show the first equivalence. So first assume P 2 (c) holds. We have to 
prove that the forests H 2 and J 2 = H 2 contain the same trees (up to isomorphism). Clearly, every 
tree from H 2 is contained in H 2 . For the other direction, let x G N + _/c and Xi+i G N+. Then the 
tree T[pi(c,x) + Xf+i,p 2 (c,x) + xi+i] occurs in H 2 . Since P 2 (c) holds, we have pi(c,x) ^ p 2 (c,x) 
and therefore pi(c, x) + xi+i 7^ p 2 (c, 2?) + x^+i. Hence this tree also occurs in H . 

Conversely suppose ilf ~ if 2 and let x G N+" fe . Then the tree T[pi(c,x) + l,p 2 (c,x) + 1] 
occurs in if 2 and therefore in _ff 2 . Hence pi(c, x) 7^ p 2 (c, x). Since x was chosen arbitrarily, this 
implies P 2 (c). □ 

Now consider the forest i?| once more. If it contains a tree of the form T[m,m] for some m 
(necessarily m > 2), then it contains all trees T[x,x] for x > m. Hence, -ffj ~ J 2 for some 
k G N+ U {w}, which implies T~ = f/ 2 for some K G N+ U {w}. Thus, with Lemma|H]we get: 

P 2 (c) does not hold ^> T- 2 ^ C/ 2 3m G N + : T2 = U 2 m 

Hence we proved the following lemma, which states (P2) for the TJ^-predicate P 2 , i.e., for i = 2. 

Lemma 10. For any c G N+, we have: 

P 2 (c) does not hold 3m G N+ : T| = . 

This finishes the construction of the trees and C/ 2 for k G N + U {w}, and the verification of 
properties (PI) and (P2). Clearly, also (P3) holds for and U 2 (all maximal paths have length 
2). 

4.1.2 Induction step: construction of T^"*" 1 and U^ 1 For notational simplicity, we write 
again k for I + 2(n — i — I) such that Pj+i is a fc-ary predicate and Pi a (fe + 2)-ary one. 

We now apply the induction hypothesis. For any c £ x, y G N+, k G N+ U {oj} let T^ xy and 
[/* be trees of height at most i such that: 
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r 



r 



nVie, y e N+ 



Va;, m G N + 



\V1 < a: < k 



2^. 



UL 



The tree Ti +1 



The tree U. 



i + i 



Fig. 2. The tree T± +1 and J7* +1 



— Pj(c, a;, y) holds if and only if Ti^ = U^. 

— Pi(c, x, y) does not hold if and only if T£ xy = for some m G N + . 

In a first step, we build the trees Tl and U' KX (x G N+) from T£ xy and resp., by adding x 
leaves as children of the root. This ensures 

Tcxy — Tcx'y' ^ s x x A T-£ Xy = T-£ X ,y, and (6) 
Tcxy — U KyX > n ^ a; x A = [7 K , (7) 

since, by property (P3), no leaf of any of the trees T£ xy or U l K is a child of the root. Next, we 
collect these trees into forests as follows: 

H l+1 =\+){U: n , x \x,meN + } , 

H i +1 = w+1 w y {T ^ | x,y G N+} , and 

J* +1 = W | 1 < x < k} for k G N+ U {w}. 

The trees T^ +1 and[/* +1 , resp., are then obtained from the forests iR and J* +1 , resp., by taking 
countably many copies and adding a root: 

T1+ 1 = r o {H^Y and C/^ +1 = r o {J l K +1 ) u , (8) 

see Figure[5J 

Note that the height of any of these trees is one more than the height of the forests defining 
them and therefore at most i + Since none of the connected components of the forests and 
J* +1 is a singleton, none of the trees in © has a leaf that is a child of the root and therefore (P3) 
holds. 

Lemma 11. For all c G we have 

P i+1 (c) holds ^ H* 1 ~ Jl +1 «=> T^ +1 = C/: +1 . 

Proof. Again, we only have to prove the first equivalence. 

First assume H^ +1 ~ J^ +1 and let x > 1 be arbitrary. We have to exhibit some y > 1 such that 
Pi(c, x, y) holds. Note that t/^ ^ belongs to J^ +1 and therefore to . Since x ^ J7,' n ^ for any 
m,x,x' G N+, this implies the existence of x',y' > 1 with T~ x , y , = U' u x . By ([7]), this is equivalent 
with x = x' and T± xy , = U^. Now the induction hypothesis implies that Pi(c, x,y') holds. Since 
x > 1 was chosen arbitrarily, we can deduce Pi+\(c). 

Conversely suppose P i+ i(c). Let T belong to Hl +1 . By the induction hypothesis, it is one of 
the trees U' KX for some x G N+, k G N + U {uj}. In any case, it also belongs to J^ +1 . Hence it 
remains to show that any tree of the form U' u x belongs to H^ +1 . So let x G N + . Then, by Pi + i(c), 
there exists y G N+ with Pi(c, x, y). By the induction hypothesis, we have T^ xy = and therefore 
TL —U^ jX (which belongs to H^ +1 by the very definition). □ 
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Lemma 12. For all c 6 there exists k 6 N + U {w} swc/i i/iai T^ +1 = C/* +1 . 

Proof. It suffices to prove that Hl +1 ~ J£ +1 for some k G N+ U {w}. Choose k as the smallest 
value in N+ U {oj} such that 

\fx > n\fy : -^Pi(c,x,y) 
holds. By property (ii) from Lemma [5] for Pi, we get 

VI < x < k 3y : Pi (c, x, y ) . 

By the induction hypothesis, we get 

\/x>K\fy: Ti xy ^ U' u x and VI < x < n3y : Tl xy = U' u<x . 

It follows that contains, apart from the trees in H t+1 = l+J{[/^ x \ x,m 6 N + }, exactly the 

trees from |+|{£/^ | 1 < x < k}. Hence, H l - +1 - J* +1 . □ 

Lemma fTTI and fT2l immediately imply: 

Lemma 13. For all c € we have 

P l+ i(c) does not hold 3m G N+ : T^ +1 = U^ 1 . 
In summary, we obtained the following: 

Proposition 14. Let n > 2 and fei &e a _ 3 -predicate. Then, for any c G N+, we have 

P(c) holds T r " = C/J, 1 . 

To infer the 7T2 n _ 3 -hardness of the isomorphism problem for T n from this proposition, it remains 
to be shown that the trees T™ and {/" are effectively automatic - this is the topic of the next 
section. 

4.2 Automaticity 

For constructing automatic presentations for the trees from the previous section, it is actually 
easier to work with dags {directed acyclic graphs). The height of a dag D is the length (number of 
edges) of a longest directed path in D. We only consider dags of finite height. A root of a dag is 
a node without incoming edges. A dag D — (V, E) can be unfolded into a forest unfold(£>) in the 
usual way: Nodes of unfold(D) are directed paths in D that cannot be extended to the left (i.e., 
the initial node of the path is a root) and there is an edge between a path p and a path p' if and 
only if p' extends p by one more node. For a node v G V of D, we define the tree unfold(-D, i>) 
as follows: First we restrict D to those nodes that are reachable from v and then we unfold the 
resulting dag. We need the following lemma. 

Lemma 15. From given k G N and an automatic dag D = (V, E) of height at most k, one can 
construct effectively an automatic presentation P with S{¥) = unfold(Z)). 

Proof. The universe for our automatic copy of unfold(D) is the set P of all convolutions v\ Cg> V2 <S> 
• ■ ■ ® D m , where v\ is a root and (vi,Vi+±) G E for all 1 < i < m. Since D has height at most k, 
we have m < k. Since the edge relation of D is automatic and since the set of all roots in D is 
first-order definable and hence regular, P is indeed a regular set. Moreover, the edge relation of 
unfold(£>) becomes clearly FA recognizable on P. □ 

For 2 < i < ri, let us consider the following forest: 

F l =|+){Ti | c G f^ +2( "- 4) } W | k G N + U M} . 

Technically, this section proves by induction over i the following statement: 
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Proposition 16. For 2 < i < n, there exists an automatic copy T % of F l and an isomorphism 
p : F l — > J 71 that maps 

1. the root of the tree T± to cf (for all c € N+ +2(n_i) J, 

2. the root of the tree to e, and 

3. the root of the tree to b m (for all m G N+). 

This will give the desired result since T™ is then isomorphic to the connected component of T n 
that contains the word a c (and similarly for U"). Note that this connected component is automatic 
by Theorem [T] since the forest T n has bounded height. Moreover, an automatic presentation for 
the connected component containing a c can be computed from c. 

By Lemma [T5l it suffices to construct an automatic dag T> % such that there is an isomorphism 
h : unfold(2> 1 ) — > T 1 that is the identity on the set of roots of T>\ 

4.2.1 Induction base: the automatic dag I? 2 Recall the definitions of Ef, a e , and <S>fc(£) 
from Section [3] 

Lemma 17. From I G N+, 91,92 6 N[xi, . . . ,xi], and a symbol a, one can compute an automatic 
forest of height 1 over an alphabet Sf ttl r such that 

— the roots are the words from (E)g(a + ), 

— the leaves are words from r + , and 

— the tree rooted at a e is isomorphic to T[qi(e), 0.2(e) ]. 

Proof. Set p(xx, . . . ,xj>) = C(qi(x±, x{), q 2 {xi, •••) x l)) an d recall the definition of the au- 
tomata A[p] and Run^[ p ] from Section [3J Recall also that we let ir be the projection with 
7f(p, a,q) = a for a transition (p, a, q) of A[p\. Then let 

L[qi,q 2 ] = ®e(a + ) U [ir -1 (®i{a + )) n L(Run A[p] )) and 
E[qi,q2] = {(u,v) | u G ®i(a + ),v G 7r _1 (u) n L(Run A[p] )} . 

Then L[qi,q2] is regular and E[qi,q 2 ] is FA recognizable, i.e., the pair (L[q±, 92]; E[q\, 92]) is an 
automatic graph. It is actually a forest of height 1, the words from ®i{a + ) form the roots, and 
the tree rooted at a e has precisely pie) leaves, i.e., it is isomorphic to T[qi(e), 92(e)]. □ 

From now on, we use the notations from Section ^. 1.11 Using Lemma [T7l we can compute automatic 
forests F\ and T2 over alphabets IJf, l ttl 7\ and S\ ttl i~2, respectively, such that 

(a) the roots of T\ are the words from (g)f + i(a + ), 

(b) the roots of F2 are the words from <E>2(b + ), 

(c) the leaves of Ti are words from F^ (i G {1,2}), 

(d) the tree rooted at a eee+1 is isomorphic to T[pi(e) + e^+i,pa(e) + e^+i] for e G N+, e^+i G N+, 

(e) the tree rooted at 6 ei62 is isomorphic to T\e\, e 2 ] for e\, C2 G N + . 

We can assume that the alphabets Ji, I2, and H\ are mutually disjoint. Let T — (Vj-,Ejr) 

be the disjoint union of T\ and Ti; it is effectively automatic. 
The universe of the automatic dag V 2 is the regular language 

® fc (a + )U6*U($*0V F ), 

where $ is a new symbol. We have the following edges: 

— For u,v G Vjr, $ m <E> u is connected to $" ® w if and only if m = n and (it, i>) G Sjr. This 
produces Ho many copies of J 7 . 

— a7 is connected to any word from $* ® ({a E * | x G N+~ fc+1 } U {b eiC2 | ei 7^ e 2 }). By point (d) 
and (e) above, this means that the tree unfold(X> 2 , a c ) has Ko many subtrees isomorphic to 
T[pi(cx) + xi + i, p 2 (cx) + x i+ i] for x G x e+ i G N+ and T[ei, e 2 ] for ei,e 2 G N+, ei ^ e 2 . 
Hence, unfold (I? 2 , a~) Z| . 



12 




T\px (c, x) + x e+1 
p 2 (c,x) + x t+1 



Vm, ei , e2 
ei # e 2 




unfold(2^,a c ) T 7 



T[ei,e a 
unfold(0 2 ,e) 17* 



Vn, ei , e2 
ei 5^ e 2 or 
ei — e 2 > m 




T[ei,e 2 ] 

unfold (V 2 ,b m ) ^ 17^ 



Fig. 3. Automatic presentation of T| and 



— e is connected to all words from $* ® {b eiC2 \ e\ ^ e 2 }. By (e) above, this means that the tree 
unfold(Z> 2 ,e) has N many subtrees isomorphic to T[ei,e 2 ] for ei,e2 G N+, ej ^ e 2 . Hence, 
unfold(D 2 ,e) *iU*. 

— b m (to € N + ) is connected to all words from $* <E> {b eie2 \ ei ^ e 2 or ei = e 2 > m}. By (e) 
above, this means that the tree unfold(P 2 , b m ) has K many subtrees isomorphic to T[ei,e 2 ] 
for all ei, e 2 G N+ with ei ^ e 2 or ei = e 2 > m. Hence, unfold (V 2 , b m ) = U^. 

Thus, unfold (P2) — F 2 an d the roots are as required in Proposition [111 see Figure[3] Moreover, it 
is clear that 2? 2 is automatic. 

4.2.2 Induction step: the automatic dag X> 1 + 1 Suppose V 1 — (V, E) is such that T l = 
unfold(2? i ) is as described in Proposition [TBI 

We use the notations from Section 14.1.21 We first build another automatic dag V , whose 
unfolding will comprise (copies of) all the trees U' KX (k G N + U {uj}, x G N+) and T^ xy (c G N+, 
x, y G N+). Recall that the set of roots of V 1 is ®fc+ 2 (a + ) Ui" C K The universe of V consists of 
the regular language 

(V\6*)U(tt+®6*)UJtf|t5, 
where jj, (fx, and fla are new symbols. We have the following edges in T>'\ 

— All edges from E except those with an initial node in 6* are present in V. 

— ap xy G V is connected to all words of the form jf^ -1 for c G N+, x, y G N+, and 1 < i < x. 
This ensures that the subtree rooted at a cxy gets x new leaves, which are children of the root. 
Hence unfold(X>', aF xy ) ^ Ti xy . 

— § x ® b m for x G N + and to G N is connected to (i) all nodes to which b m is connected in T> 1 
and to (ii) all nodes from f^ x _i for 1 < i < x. This ensures that unfold (D', f <g> 6 m ) ^ L/^ x 
in case to G N + and unfold (X>', jf <g> e) = U' u>x . 

In summary, 2?' is a dag, whose unfolding consists of (a copy of) U^ x rooted at ft 35 ® e, [/^ x 
(to G N+) rooted at f <g> b m , and T^ y rooted at a cxy . 

From the automatic dag £>', we now build in a final step the automatic dag T> 1+1 . This is very 
similar to the constructions of V 2 and V above. Let V be the universe of V . The universe of 
V l+1 is the regular language 

<g) fc (a+)U&*U($* ®V') • 

The edges are as follows: 

— For u, v G V, $ m ® u is connected to $" ® u if and only if to = n and (u, w) is an edge of V . 
This generates Ho many copies of V . 

— a c is connected to every word from $* eg) ({a cxy \ x,y G N+} U (jj + ® fe + )). Hence, the tree 
unfold(P' +1 , a c ) has Ko many subtrees isomorphic to T^ xy for x, t/ G N + and E/j^,,,. for x,m G 
N+. Thus, unfold(X> i+1 ,a~) = T^ +1 . 
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unfold(© l+1 ,a c ) 2j +1 
Fig. 4. Automatic presentation of 



e 




e (g> $ m ® if <g> (7* e ® $ m ® |t x ® t/* 

unfold(© l+1 ,e) ^ t/i +1 



Fig. 5. Automatic presentation of 

— e is connected to all words from $* <g> ® b*). Hence, the tree unfold(£>* +1 , e) has Ho many 
subtrees isomorphic to U' K X for all x G N+ and k G N+ U {u;}. Thus, unfold(2? 4+1 , e) = U^ +1 . 

- b m (in G N + ) is connected to all words from $*<8>((jj + ®b + )u{$ x ®e | 1 < x < m}). This means 
that the tree unfold(2? l+1 , b m ) has N many subtrees isomorphic to U' m x for all to, x G N + and 
E/^ ia . for all 1 < x < m. Hence, unfold(D i+1 , b m ) = 

See Figure HlEl and [5] for the overall construction. This finishes the proof of Proposition [T^l Hence 
we obtain: 

Theorem 18. 1. For any n > 2, the isomorphism problem for automatic trees of height at most 
n is H^n^- complete. 

2. The isomorphism problem for the class of automatic trees of finite height is recursively equiv- 
alent to FOTh(N;+, x). 

Proof. We first prove the first statement. Containment in -Z72„_ 3 was shown in Proposition \7\ For 
the hardness, let P n C be any il^^-predicate and let c G N+. Then, above, we constructed 
the automatic forest T n of height n. The trees T™ and ?7™ are first-order definable in F n since 
they are (isomorphic to) the trees rooted at a c and e, resp. Hence these two trees are automatic. 
By Proposition [TH they are isomorphic if and only if P n (c) holds. 

We now come to the second statement. Since the proof of Prop. [7] is uniform in the level n, 
we can compute from two automatic trees T\^Ti of finite height an arithmetical formula, which is 
true if and only if T\ = T^. For the other direction, one observes that the height of an automatic 
tree of finite height can be computed. Then the result follows from the first statement because of 
the uniformity of its proof. □ 

In fact, we proved a slightly stronger statement: For every n > 2, there exists a fixed ^2ri-3" 
complete set P^n-z ^ N+. If we apply our construction, we obtain a fixed automatic forest J 7 ™ 
of height n with the following properties: It is 772„_ 3 -complete to determine, whether for given 
c G N+, the tree rooted at a c in T n is isomorphic to the tree rooted at e in J 7 ™. 
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b m ® $" (g> f ® Ul b m ® $" <g> (t x ® !7£ 



unfold(-D l+1 ,6 m ) 
Fig. 6. Automatic presentation of E/^j" 1 

5 Recursive trees of finite height 

In this section, we briefly discuss the isomorphism problem for recursive (i.e., computable) trees 
of finite height. 

Theorem 19. For every n > 1, the isomorphism problem for recursive trees of height at most n 
is H\ n - complete. 

Proof. For the upper bound, let us first assume that n = 1. Two recursive trees Ti and T2 of 
height 1 are isomorphic if and only if: for every k > 0, there exist at least k nodes in T% if and 
only if there exist at least k nodes in T2. This is a il^-statement. For the inductive step, we can 
reuse the arguments from the proof of Proposition [7] 

For the lower bound, we first note that the isomorphism problem for recursive trees of height 1 
is il^ -complete. It is known that the problem whether a given recursively enumerable set is infinite 
is 7?2 -complete |19j . For a given deterministic Turing-machine M, we construct a recursive tree 
T(M) of height 1 as follows: the set of leaves of T(M) is the set of all accepting computations 
of M. We add a root to the tree and connect the root to all leaves. If L(M) is infinite, then 
T(M) is isomorphic to the height-1 tree with infinitely many leaves. If L(M) is finite, then there 
exists m € N such that T(M) is isomorphic to the height-1 tree with m leaves. We can use this 
construction as the base case for our construction in Section 14.1.21 This yields the lower bound 
for all n > 1. □ 



6 Automatic Linear Orders 

We use u> to denote the linear order (type of) (N; <) of the natural numbers and n to denote the 
finite linear order (type) of size n. Let I = (Dj; </) be a linear order and let C = {Li \ i g Di} 
be a class of linear orders, where Li = (Dj; <i) for i G Dj. The sum is the linear order 
({(x,i) I i € Di,x £ Di}; <) where for all i, j £ Di, x £ Di, and y £ Dj, 

(x, i) < (y, j) i<ij\/(i = jAx <i y) . 

We use Li + L 2 to denote J2i^i | * € 2}. We denote with Lj • L 2 the sum ^{L\ \ i £ L 2 } where 
L\ = L\ for every i £ L 2 . An interval of a linear order L = (D; <) is a subset 7 £ D such that 
x,y £ 7 and x < z < y imply z £ I. 

A well-known example of an automatic linear order is the lexicographic order <i ox on a regular 
language D. To define <i cx , we first need a fixed linear order < on the alphabet S of D. For 
w,w' £ D, we say that w is lexicographically less than w', denoted by w <\ cx w' , if either w is a 
proper prefix of w' or there exist x,y,z £ E* and <r, r £ £ such that u> = xay, w' = xtz and 
a < t. We write u> <i ox ui' if either u> = w' or to <i cx w' . For convenience, in this paper, we use 
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<ic X to denote the lexicographic order regardless of the corresponding alphabets and orders on the 
alphabets. The precise definition of <i ox in different occurrences will be clear from the context. 

This section is devoted to proving that the isomorphism problem on the class of automatic 
linear orders is at least as hard as FOTh(N; +, x). To this end, it suffices to prove (uniformly in 
n) I7°-hardness for every even n. The general plan for this is similar to the proof for trees of finite 
height: we use Hilbert's 10 th problem to handle TT^-predicates in several variables and an inductive 
construction of more complicated linear orders to handle quantifiers, i.e., to proceed from a LT^i-i" 
to a Zl^-predicate (and from a S^r t° a n^+i-predicate). 

So let n > 1 be even and let P n (xo) be a Z^-predicate. For every odd (even) number 1 < i < n, 
let Pi(xo, • ■ • , x n -i) be the 7T 4 °-predicate (^-predicate) such that Pi + i(xo, . . . , x n -i-i) is logically 
equivalent to Qx n -i : Pi(xo, ■ ■ ■ , x n -i) where Q = 3 if i is odd and Q = V if i is even. We fix these 
predicates for the rest of Section |B1 

By induction on 1 < i < n, we will construct from c G the following linear orders: 

— a test linear order Li, 

— a linear order K l , and 

— a set of linear orders M 1 such that M 1 = {M^ \ m G N+} and M l is the singleton {M 1 } if 
i>l. 

These linear orders will have the following properties: 
(PI) Pi(c) holds if and only if L l - = K\ 

(P2) Pi(c) does not hold if and only if LL = M for some M G M l . 

(P3) The linear order uj ■ i is not isomorphic to any interval of L^, K l , M where M G A4 l . 

In the rest of the section, we will inductively construct L^, K l , and A4 l and prove (PI), (P2), 
and (P3). The subsequent section is devoted to proving the effective automaticity of these linear 
orders. 

6.1 Construction of linear orders 

Our construction of linear orders is quite similar to the construction for trees from Section 14.11 
One of the main differences is that in the inductive step for trees, we went from a TT^-predicate 
directly to a 7T° +2 -predicate. Thereby the height of the trees only increased by one. This was 
crucial in order to get 772„_3-completeness for the isomorphism problem for automatic trees of 
height n > 2. For automatic linear orders, we split the construction into two inductive steps: in 
the first step, we go from a TT-'-predicate (i odd) to a -predicate, whereas in the second step, 
we go from a Sf +1 -predicate to a TT^-predicate. 

A key technique used in the construction is the shuffle sum of a class of linear orders. Let / be 
a countable set. A dense I-coloring of Q is a mapping c : Q — >• / such that for all x, y G Q with 
x < y and all i G / there exists x < z < y with c(z) = i. 

Definition 20. Let C = {Li \ i G /} be a set of linear orders with L countable and let c : Q — > / 
be a dense L-coloring o/Q. The shuffle sum of C, denoted Shuf (£), is the linear order X^gQ ■^ J c(x) ■ 

In the above definition, the isomorphism type of J2x£qLc(x) does not depend on the choice of the 
dense /-coloring c, see e.g. [20]. Hence Shuf(£) is indeed uniquely defined. 

In this section, we will consider classes L\ and £2 of linear orders that we consider as classes 
of isomorphism types. Therefore, we use the following abbreviations: 

— 11 L G £1" denotes that C\ contains a linear order isomorphic to L, 

— "£1 C £ 2 " denotes VXi G C\ 3L 2 G C 2 ■ L x = L 2 , and 

— "£1 = £2" abbreviates C\ C £ 2 Q £i- 
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6.1.1 Induction base: construction of Li, K 1 , and Recall from Section [3] that the 

polynomial function C(x, y) = [x + y) 2 + 3a; + y is injective. For m, n 2 G N+, let L[ni, 712] be the 
finite linear order of length C(ni,ri2). 

By applying Matiyasevich's theorem, we obtain two polynomials p\(x) , p2{x) G N\x] in £ vari- 
ables, I > n, such that for all c G N™, the TT^-predicate Pi(c) holds if and only if 

VfGN'-" : Pl (c,S)^p 2 (c,5) . 

Fix c G N" and to G N+. We define the following four classes of finite linear orders: 

£\{c) = {L[ Pl (c,x) + x e+1 ,p 2 (c,x) + x e+1 ] \xeN<+ r \x e+1 G N+} (9) 

£\{m) = {L[x + m,x + m) \ x G N+} (10) 

Cl = {L[x + y,x]\x,yeN+} (11) 

£l = {L[x,x + y}\x,y£N+} (12) 

The linear orders L~, K 1 , and are obtained by taking the shuffle sums of unions of the above 
classes of linear orders: 

L\ = Shuf(£ x (c) U C\ U C\), K 1 = Shuf(4 U C\), M„\ = Shuf (£|(rn) U C\ U 4). 

The next lemma is needed to prove (PI) and (P2) for the LL^-predicate Pi. 

Lemma 21. Suppose £\ and £ 2 are two countable sets of finite linear orders. Then 

£1 = £ 2 <^=> Shuf(£ x ) s Shuf(£ 2 ) 

and no interval o/Shuf(£i) is isomorphic to oj. 

Proof. If £1 = £2, then it is clear that Shuf(£i) = Shuf(£ 2 ). Conversely, suppose there exists an 
isomorphism / from Shuf(£i) to Shuf(£ 2 ). We prove below that £\ = £2. By symmetry we only 
need to prove £\ C £2. 

Note that for i G {1,2}, Shuf(£j) is obtained by replacing each q G Q with some linear order 
L q (whose type is) contained in £1. For every q G Q, if f(L q ) contains elements from L 2 and L 2 , 
for some p < p' , then f{Lt) is infinite which is impossible. Therefore / maps L q into L 2 for some 
p G Q. Using the same argument with / replaced by / , we can also prove that f^ 1 maps L 2 into 
L q . Hence L q = L^. This means that for all L G £%, there is L' G £2 such that L = L'. Therefore 

A c £ 2 . 

If Xx < X2 < ■ ■ ■ in Shuf(£i), then there are p < p' in Q and fc < £ in N+ such that Xk G L* 
and a;^ G Lp,. But then the interval [xk,xi\ is infinite. Hence no interval in Shuf(£i) is isomorphic 

tO LU. □ 

The next lemma states (PI) and (P2) for i = 1: 

Lemma 22. For any c G N" , we Ziawe: 

(%) Pi(c) /10/ds Ll^K 1 . 

(2) Pi(c) does not hold 3m G N+ : Li = M} n . 

Proof. For (1), we have 

P l (c) <=> G : pi (c, z) ^ p 2 (c, x) 



Vx £ N' + ", x e+ i G N+ :pi(c,x)+ x e+ i ^p 2 {c,x) + x e+ 



> \/x G G N+ : L[p x (c,z) +x e+1 ,p 2 {c,x) + x i+ i] G £3 U £4 

> £ 1 (c)U£^U£l = £ju£l 
^Li-tf 1 . 
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For (2), we get 

-Pi (c) ^=> 3x G f^T" : Pl (c, 2) = p 2 (c, 5) 

3meN + :i[m + l ) m+l]g4(c) 
<J=^ 3m G N+ : (Vfc > m : L[k,k] G C\(c) A VI < k < m : L[k,k] C\{c)) 
3mGN+ :£|(c)U£3U£l = 4(m)u4u£l 

Lcmma l21l -, r i , ,i 

<S=X— 1 3m G N+ : £± = . 

□ 

Since Li, if 1 , and are shuffle sums, they satisfy (P3) by Lemma |2~T1 This finishes the con- 
struction for the base case. 



6.1.2 First induction step: from Pi to Pi+i for i odd Suppose i > 1 is an odd number. 
For notational simplicity, we write k for n — i. Thus, Pi+i is a fc-ary predicate and Pi is a (fc + 1)- 
ary one. For all c G N+, Pj+i(c) is logically equivalent to 3x : Pi(c,x). Applying the inductive 
hypothesis, for any c G and x G N+, we obtain linear orders L^, K l , and the set Af' such 
that 

— Pi(c, x) holds if and only if L\ x = if 1 , 

— Pj(c, x) does not hold if and only if L\ x = M for some M G Af l , and 

— w • i is not isomorphic to any interval of L^ x , K l , or M where M G M l . 

Fix c G NiL. We define the following classes of linear orders: 
£'i +1 (c) = {wi + Li. | x E N+}, £ 2 +1 = {wi + Af | M G M 1 }, £g +1 = {w -i + K'}. (13) 

The linear orders L^ +1 , -FP +1 , and AP +1 are defined as shuffle sums of unions of the above classes 
of linear orders: 

Li +1 = Shuf(£l +1 (c) U K l+1 = Shuf(4 +1 U 4 +1 ), M' i+1 = Shuf(4 +1 ). (14) 

Recall that the set Ai l is a singleton for i > 1, consisting of M\ The next lemma can be proved 
similarly as Lemma [2~T] 

Lemma 23. Suppose C\ and £2 are two countable classes of linear orders such that each L G 
£i U £2 is isomorphic to a linear order of the form uj ■ i + K , where uj ■ i is not isomorphic to any 
interval of K . Then 

£1 = £2 <^=> Shuf(A) = Shuf(£ 2 ) ■ 

If Shuf(£i) contains an interval isomorphic to UJ ■ (i + 1), then there is a linear order K with 
uj- (i + l)+K G £1. 

Proof. If L\ = £2, then it is clear that Shuf(£i) = Shuf(£2). Conversely, suppose / is an isomor- 
phism from Shuf(£i) to Shuf(£ 2 ). We prove that £1 = £2. By symmetry we only need to prove 
that £1 C £ 2 . 

Say Cj = {Lj^ s \ s G N} for j G {1,2}. Intuitively, for j G {1,2}, Shuf(£,) can be viewed 
as obtained by replacing each q G Q with a linear order L(j,q) = Lj C ^, where c is a dense 
N-coloring. Fix q G Q. Suppose f(L(l,q)) contains elements in L(2,p) and L(2,p') for p,p' G Q 
with p < p' . Then in f(L(l,q)) there arc infinitely many disjoint intervals that are isomorphic to 
uj ■ i, while in L(l, q) there is exactly one such interval, a contradiction. Therefore / maps L(l, q) 
into L(2,p) for some p G Q. 

If f(L(l,q)) £ L(2,p), then f~ 1 (L(2,p)) contains an element x ^ L(l,q). The argument 
from the previous paragraph with / replaced by / _1 again leads to a contradiction. Therefore 
f(L(l,q)) — L(2,p). This means that for all L G £1, there is V G £2 such that L = V and the 
lemma is proved. 
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Let I = u) ■ (i + 1) be some interval in Shuf(£i). First suppose there are p < r in Q such that J 
intersects L(l,p) and L(l, r). But then L(l, q) C I for all p < g < r, implying that (Q, <) embeds 
into / = u) • (i + 1) which is impossible. Hence there is some q G Q with / C L(l,g) G £1. Then 
there is a linear order K such that L(l, q) = u-i + K. Since w • i (let alone uj ■ (i + 1)) is no interval 
in K, the interval / has to intersect the initial segment w • i of L(l, q). But then u has to be an 
initial segment of K, i.e., L(l, q) = cj ■ (i + 1) + K' for some linear order K' . □ 

Now notice that w ■ (i + 1) is not isomorphic to any interval of Li + , AT 2+1 , or M l+1 (each of the 
orders L\ x , K l , and M G M l is a shuffle sum and therefore does not start with to). Hence (P3) 
holds for i + 1. Furthermore, the following holds: 

Pi+i (c) <S=^ G N+ : P i+1 (c, a;) 

<^> 3x G N+ : Li a S JP 

^ 4 +1 c £l +1 (c) 

<S=^ (c) U 4 +1 = U 4 +1 



Lci nma|23l 7-i-(-l 



i+1 



L^ L = K 

nP l+1 (c) <^> G N+ : ^P m (c, s) 

<^> \/x G N + 3Af G Af : 24 = M 

4 +1 (c)u4 +1 =4 +1 

Lomma[23] ^ M i+1 



We have shown (PI) and (P2) for i + 1 in case i is odd. 

6.1.3 Second induction step: from Pi to Pi_|_i for i even Let i > 1 be even and consider 
the iT?, i -predicate P;+i. Again, we write fc for rt — i. For all c G N+, Pj_|_i(c) is logically equivalent 
to Vx : Pi(c, x). Since i is even, we must have i > 2. Therefore the set A4 l is a singleton, consisting 
of the linear order M\ 

Fix c G N5_. Define the classes of linear orders £i +1 (c), an d 4 +1 using the same definition 

as m (fT3l) . The linear orders and AP+ 1 are defined as follows: 



= Shuf(£l +1 (c) U 4 +1 ), iT +1 = Shuf(4 +1 ), M i+1 = Shuf(4 +1 U 4 +1 ). 

Again, u) ■ (i+ 1) is not isomorphic to any interval of Lg + , K l+1 , or M l+1 . Hence (P3) holds for 
i + Furthermore, the following holds: 

P+i(c) <^> VxeN + :Pi{c,x) 

\/x G N+ : L\ x = K i 
^ £l +1 (c)u4+ 1 = 4+ 1 

Lomma[23] ^ 

-■P i+ i (c) <^> 3s G N+ : -P(c, x) 
3x G N + : Li K = M l 

<=>• c\ +1 (c) u 4 +1 = 4 +1 u 4 +1 

Lemmam — M' i+1 

We have shown (PI) and (P2) for i + 1 in case i is even. This finishes the construction and proof 
for (PI), (P2), and (P3) in the inductive step. 

6.2 Automaticity 

To construct automatic presentations of the linear orders from the previous section, we first fix 
some notations. For c — (ci, . . . , c&) G N5_ and a symbol a, we re-define a c as the word 

a Cl tt---a Cfc ttG{a,tt}*. 



19 



Recall that Lemma [5] described a way to represent a polynomial p(x) G N[x] in fc variables using 
the number of accepting runs of an automaton .4[p(x)]. The next lemma re-states Lemma [5] with 
respect to the new definition of a c . 

Lemma 24. From a polynomial p(x) G m fc variables, one can effectively construct a non- 
deterministic automaton A[p(x)] on alphabet {a, ft} such that L(A[p(x)]) — (a + ft) fc and for all 
c G : ^4[p(x)] has exactly p(c) accepting runs on input a c . 

Proof. We use the same proof as for Lemma [2] The only difference is when the polynomial 
p(xi, . . . , Xk) is of the form n for some i G {1, ...,&}. In this case, the automaton ^4[a;j] is 
(5, /, A, F) where S = {qo, qi, ■ ■ ■ qk, q'i], I = {<Zo}j F = {<?fc} and the transition relation A is 

^ = {(<7j-i,tt,<7j) I 1 < 3 < k d ^ i} U {(<7,a,q) | g G S 1 } U {(ft-i, a, aj), (g£, ft, #)}. 

It is easy to sec that L(^4[xj]) = (a + ft) fe and ^4[xj] has exactly q accepting runs on input a c where 
c G N+. □ 

From now on, when referring to ,A[p(x)], we always assume it is defined in the sense of Lemma [24] 
(as opposed to Lemma [2]) . Let A be a non-deterministic finite automaton over the alphabet S 
and let A be the transition relation of A. Recall the definition of the automaton Run^i and the 
projection morphism tt : A* — > S* from Section[3] Then, Run^ is an automaton over the alphabet 
A. Assume that a lexicographic order <i cx has been defined on each of S* and A*. Define the 
automatic linear order C on L(Run_4) such that for all w, w' G L(Run^): 

w C w' <^==> tt(w) <i cx tt(w') V (ir(w) — tt(w ) A w <i cx w'). (15) 

Let Si be the alphabet {ft, $i, . . . , $, 0, 1, a, b%, 02, ^3}- Fix the order < on Si such that 

$<$!<•■•< < < ft < a < h < 62 < 63 < 1. (16) 

For any automaton A over Si, fix an arbitrary order on the transition relation A of A. Let <i ox 
be the lexicographic orders on S* and A* defined with respect to these orders, respectively. From 
now on, we will always let C be the linear order as defined in (|15p with respect to <i cx - For a 
regular language L C S* let first(L) = {a G S | 3w G S* : aw G L}. For u G S*, we use to 
denote the language uS* R L. Technically, in this section we prove by induction on i the following 
statement: 

Proposition 25. We can compute automata A 1 over Si such that: 

(1) L(A 1 ) = ((a+ft)" U 6+ft U 6 2 ft)$i? for some regular language R C £+ 

(2) lfi>\, then L(A l ) = ((a + ft)"" 4+1 U 61ft U 6 2 ft)$i? for some regular language R C 
C3) L| S ( 7 r- 1 ( J L(^)[a% n L(Run^); E) jfor c G N"~ i+1 

ft) ^ (^- 1 (i(^l 1 )[6' 1 "ft]) n L(Run^); Q /or m G N+ 
(5) AP - (Tr-^i^Ot&ift]) n L(Run^); Q /or i > 1 

(e; x i = (^(L^MMD n L(Run^); Q 

Moreover, in (1) and (2) we have first(i?) C {0, 1}. 



6.2.1 Effective automaticity of shuffle sums This section shows that we can construct an 
automatic presentation of the shuffle sum of a class of automatic linear orders that are presented 
in some specific way. For a regular language D over an alphabet, which does neither contain nor 
1, let a(D) = ({0, 1}*LD)+. 

Lemma 26. Let A be an automaton such that L{A) = ED$F for regular languages E, D C 
{a, 61, 02, 03, ft}* and F C S* (for some 1 < i < n). We can effectively compute an automaton 
a (A, E) such that L(a(A, E)) = E$a(D)$F and for all u G E: 

(tt" 1 (w$ct(D)$ j F 1 ) n L(Run a(A>E) ); Q ^ Shuf({(7r -1 (m;$.F) n L(Run A ); Q\v€ D}). 
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Proof. Suppose A = (S, I, A, Sf). Let r — {a, bi, &2, 03, jj}. We first define the automaton 

A' = (S x {1, 2, loop}, I x {1}, A', S f x {2}). 

The transition function A' of A' is defined as follows: 

A' = {({q,l),a,(p,l)) I («,a,p) e A,ae T}U 
{((g,l),$,(g,loop)) \qeS}U 
{((<?, loop), a, (g, loop)) a efU{0,l}}U 
{((g,loop),l,(g,2)) |geS}U 
{((<j,2),a,(p,2)) | (g,a,p) G 4} 

Intuitively, A! consists of two copies of A whose state spaces are S x {1} and S x {2}. The 
automaton A' runs by starting simulating A on the first copy. When the first $ is read, it stops 
the simulation. For this, the automaton stores the state q by moving to the "looping state" (q, loop). 
The automaton will stay in (q, loop) unless 1 is read, in which case, it may "guess" that it reads the 
last 1 before the second $ in the input. If so, it goes out of (q, loop) and continues the simulation in 
the second copy of A and accepts the input word if the run stops at a final state. If the guess was 
not correct and there is another 1 before the second $ in the input, then the run will necessarily 
reject. 

It is easy to see that for all ui, u 2 G -T*, v G (.TU{0, 1})*1 and U3 G F, the number of accepting 
runs of A' on u\%vu2$U3 is the same as the number of accepting runs of A on uiit 2 $ii3, i.e., 

|Z(Ruru') n7r~ 1 (ui$i;'ti2$M3)| = \L(Run A ) n 7r -1 (tiitt2$U3)|. (17) 

Let 

a(A,E) = E$a{D)$F n A'. 

Note that L(cr(A, E)) = E%a{D)%F. Also, for any u x G E, v G ({0, 1}*LD)*{0, 1}*1, u 2 G D, and 
M3 G F, the number of accepting runs of a(A,E) on u\%vu2$U3 equals the number of accepting 
runs of A' on ui$vu2$U3, which is, by (|17[) . equal to the number of accepting runs of A on u\U2$U3. 
Hence, we have 

\L(K\m a ^E)) H 7r _1 (ui$vu2$U3)\ = \L(R\m A ) n 7r^ 1 (uiw 2 $U3)|. (18) 
We prove the following claim. 

Claim 1. For all m G E, v G ({0, 1}*1£>)*{0, 1}*1 and u 2 G D, 

(■K- 1 (u 1 $vu 2 $F) n L{Run a{AtE) ); C) S (7T — ^ (iti U2 ) n ^(Run^); □). (19) 

For u G F, let L(u) = (7r -1 (uiU2$u)n.L(Run,4); □). Note that this is a finite linear order. Consider 
the linear order (F; <i cx ). By definition of C, 

(7r- 1 (wiu 2 $i ;l )ni(Run^);C) = ^ L{u). 

U£F 

By (fT8|) . L(it) = (it~ 1 (ui$vu2$u) fl X(Run (T (^ i £;)); C). By definition of C again, 

(ir~ 1 (ui$vu2$F) n L(Run (T (^4. B )); C) = (n -1 (ui$vv,2$u) D L(Run CT (_A ; £j); C) 

= {n- 1 { Ul u 2 $F) n L(Run^); □). 

This proves Claim 1. 
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Let c : a(D) — >• D be the function such that 

V.t G ({0, 1}*1L>)*{0, 1}*1 \/ueD: c{xu) = u. 

Claim 2. (a(D); <\ ex ) = (Q; <) and the function c is a dense £>-coloring of (cr(-D); <i ex )- 
First, for every w = xlu G cr(-D) with x G ({0, 1}*1£>){0, 1}* and 11 e fl, we have 

xOlu <ic X w <icx xllu. 

Hence, (cr(-D); <icx) does not have a smallest or largest element. It remains to show that the linear 
order (a(D); <i ex ) is densely D-colored by c (this implies that (<j(D)\ <i ex ) is dense and hence, by 
Cantor's theorem, isomorphic to (Q; <)). Consider two words w 1 ,i«2 G <?(D) such that W\ <i cx w 2 - 
There are two cases. 

Case 1. wi — xay, W2 = xfiz for x, y, z G [T U {0, 1})* and a, j3 e r U {0, 1} such that a < /?. In 
this case, for all u G -D, we have 

»i <i cx wilw <i ox W2 and u>ilu G cr(D). 

Case 2. W2 = w\x for some 1 e (fU {0, 1}) + . Since W2 G °~{D), we have 2 ^ 0*. Say a: = J ay for 
some j > 0, a ^ and y G (T U {0, 1})*. We must have a G {1, a, 61, &2, &3, tt}- Since every symbol 
from this set is larger than (see (fl6|) ) we must have a > 0. Then for all u G -D, we have 

w% <iox i^iO^ 1 !?/ <i C x W2 and u?i(F + lu G cr(D). 

Hence (a(D)\ <i ox ) is indeed densely colored by c. This proves Claim 2. 

Since $ is the minimum in the order < on Si, for any u G -E, v, v' G cr(D) and iu, u/ G -F, we have 

« <lcx => 7i$77$W <l ox U$v'$w'. 

Therefore, 

( 7 r- 1 (w$a( J D)$F)n J L(Run CT(AJ5) );C) S ^ (tt-^uSwSF) n L(Run CT(AE) ); C) 

■uG<t(D) 
Claim 1 x — , . 

- H {ir- 1 (uc(v)$F)nL(Rnn A );Q 

Claim 2 

^ Shuf({(7r- 1 (ww$ J F) n i(Run^); C) | v G £>}). 

□ 

6.2.2 Base case: automatic presentations for Z/i, if 1 , and Af^ Recall the notations 
from Section [6.1.1l In the following, if is a regular language and A is a finite non-deterministic 
automaton then we denote by DA a finite automaton that results from the disjoint union of a 
deterministic automaton Ad for D and the automaton A by adding all transitions (q, a,p) where: 
(i) q is a state of Ad, (ii) there is a transition (q, a, in Ad, where q' is a final state of Ad, and 
(iii) p is an initial state of A. Clearly, L(DA) = DL(A). We will only apply this definition in case 
the product DL(A) is unambiguous. This means that if u G DL(A) then there exists a unique 
factorization u = U1U2 with u\ G D and U2 G £(^4). The following lemma is easy to prove: 

Lemma 27. Let A be a finite non- deterministic automaton and let D be a regular language such 
that the product DL(A) is unambiguous. Letu\ G D andu2 G L(A). Then, the number of accepting 
runs of DA on U1U2 equals the number of accepting runs of A on U2- 

Lemma 28. From two given polynomials q\(x),q2(x) G N\x] in k variables, one can effectively 
construct an automaton A[qi, (72] over the alphabet {a, $} such that 
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- L(A[q u q 2 ]) = (a+f) k %and 

- For allcGN k + , (tt- 1 ^) n L(Run^ [gii , a] ); Q = L[ qi (c),q 2 (c)}. 

Proof. We construct _4[qi,g 2 ] by taking a copy of A[C(q\(x), q 2 {x))} (see Lemma [24)) . adding a 
new state q$ and transitions (g/, S, q$) for each accepting state <?/ in _4[C((7i(3;), 52(2;))] and making 
q$ the only accepting state of A[qi, q 2 ]- Note that for any c G N+, the number of accepting runs of 
A[q\, q 2 ] on a c $ is the same as the number of accepting runs of A[C(q\(x), q 2 (x))] on a c , which is 
equal to C(qi(c),q 2 (c)). Hence, (7r _1 (a c $) n L(Run_4[ gi 92 j); C) forms a copy of L[qi(c),q 2 (c)} and 
the lemma is proved. □ 

By Lemma [28l we can construct automata Ai = A[pi(x) + xi + i,p 2 (x) + xe+i], where x G N+, over 
the alphabet {a, (),$}, A 2 = A[xi +x 2 ,xi + x 2 ] over the alphabet {61, (),$}, A3 = A[xi + x 2 ,xi] 
over the alphabet {6 2 , )J, $} and Ai — A[xi, x\ + x 2 ] over the alphabet {&3, ft, $} such that: 

Vc e N e + Vc £+1 G N+ : (^(a^S) n L(Run^); Q £* L[ Pl (c) + c m ,p 2 (c) + c* +x ] (20) 



Vei,e 2 G N+ 
Vei,e 2 G N+ 
Vei, e 2 E N + 



(7r- i (6rK 2 S$)nL(Run^ 2 
( 7 r- 1 (6^ 1 tJ^ 2 tJ$)n J L(Run^ 3 

^—1 /l,ei 44l, e 2 4. 



Q = L[ ei +e 2 , ei +e 2 } (21) 
E) = £[ei + e 2 ,ei] (22) 
C)SL[ei,ei+e 2 ] (23) 



( 7 r^ 1 (6^«6^tt$)nL(Run A 
Define the following automata: 

i^Aatffl+l)"^^)), ^ = i 2 a(fc+p 3 aA4)), .A3 = b 2 $(A 3 ttA 4 ). 

Note that 

L(A°) = (a+r (V») £ ~" +1 U (^tt) 2 U (^ttfk 



L(^) = 6+ft^+ttU(fe+tt) 2 U(6+tt) 2 )$ 

^-^((Wu^+tt) 2 ^. 



Hence, applying Lemma [26] (with F = {e}), we can effectively construct automata Al (j G 
{1, 2, 3}) as follows: 

A|=^,(a + t)"), Al = a(A° 2 ,bU), Al = a(Aib 2 $). 

For all c e N" we get: 

_ Lemma 1261 

(^- 1 (L(^)[a c ])nL(Run^);C) = 

Shuf({(7r- 1 (a^$) n L(Run^o); C) | v G (a+^-" +1 U (fe+tt) 2 U (^B) 2 }) = 

Shuf ({(^(a"®) n L(Run^o); E) | e G N^" +1 } U 

{(vr- 1 ^ 1 ^ 2 ^) ni(Run^o); C) | ei ,e 2 G N+} U 

. _ Lemma 1271 

{ (tt- 1 (a c 6^ tt&? tt$) n i(Run^o ) ; C) I ei , e 2 e N+ }) S 

Shuf({(7r- 1 (a 7;s $) n ^(RuiuJ; E) | e G N e + n+1 } U 

{(^^^^WnifRun^)^) I ei ,e 2 GN+}U 

{(7r-^W$)n£(Rmu 4 );Q I e lf e 2 GN+}) m = m 
Shuf ({L[p x (c,e) + e £+ i,p 2 (c,e) + e<+i] | e 6 N l + n ,e e+1 G N+} U 

{i[ei +e 2 ,e x ] | ei,e 2 G N+} U {L[e i ,e 1 + e 2 ] | ei,e 2 G N+}) 
Shuf(£}(c) UC\ UC\) Si 4 
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Similar calculations yield: 



Vm g N+ : (^(LiADn) n L(Run^); Q - Shuf(£|(m) U U 4) £ 
(^(L^MD n L(Run^); Q S Shuf(4 U 4) <* if 1 

Let .4 1 = A\ W ^ W ^3- is eas y to sce tnat ^M 1 ) = (( a+ tD™ u &i tt u M)$-R for some regular 
language i? C I?^ with first(i?) C {0, 1}. Hence A 1 satisfies the statement in Proposition |2~51 

6.2.3 First inductive step: automatic presentations for L^ 1 , K 1 ^ 1 , for i odd 

Let i > 1 be an odd number. Recall the notations from Section T6.1.2I We write k for n — i. By 



applying the inductive assumption, we obtain an automaton A 1 such that L(A l ) = ((a + $) k+1 U 
/3ft U &2|t)$-R f° r some regular language R C Z 1 * where /3 = if i = 1, and (3 = bi otherwise. 
Furthermore, first(i?) C {0, 1} and the following hold for A 1 : 

Vc G N* +1 : 4 - (7r _1 (a~$i?) n L(Run A i); E) (24) 

X 1 S {(7r _1 («tt$i2) n L(Run^); E) I « G /?} (25) 

iT = (7r _1 (i> 2 pi?!) n L(Run^); E) (26) 

For any 1 < j < n, let Sj t = $f U • • • U It is easy to see that 

(S,;<te*)^w-j. (27) 

Define the automata B|, £>|, and S3 as 

= ((a+(J) fc+1 $i? n A*) tt) (a+|t) fe+l $Si, (28) 

= (/3(t$ii n „4 4 ) W (SjJSSi, (29) 

B l 3 = {b 2 $$R n „4 ,; ) W battSft. (30) 

By (HU), ([2"i |) -(j2"? l> . and the fact that first (i?) C {0, 1}, we have 

Vc G N^ +1 : ( 7 r- 1 (a"$(5i U i?)) n L(Run B i); C)Su.| + 4, (31) 

{(7r~ 1 (utlS(S' l U i?)) n L(Run e , ); C) | tt G {w • i + M | M G M 1 }, (32) 

(tt -1 (62^(5, U R)) n L(Run e ,); C) S u ■ i + iT. (33) 



Now construct the automata C^Q, and C\ as follows: 

C\ = B\ W (a+H) fc ^, = bipl, Cj = 6att(B5 w 

We have 

L(Cj) = (a+tt) fc (a + BU/3tt)$(5,Ui?), 

L(C^6 1 »(5,Uii), 

L(C|) = & 2 tJ(/3|J U &2|t)$(5i U R). 

Hence, we can apply Lemma |2"51 to C\, C|, and C| (with i* 1 = Si U i?) to define the following 
automata: 

A\ +1 = a(Cj,(a+tt) fc ), 4+ 1 =^1,6x0), ^ 3 +1 = <r(Cj, M)- 
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For all c E we get: 

. , - _ Lemma |26I 

( L (A\ +1 )[a c ]) n HRno-Ap 1 )': E) = 

Shuf({(7r- 1 (aS$(S' l U i?)) n L(Run C j); Q | » € a+tf U 08}) = 
Shuf({(7r- 1 (a Ee $(5 l U R)) n L(Run c «); C) I e G N+} U 

_ Lemma 1271 

{( 7 r- 1 (a c utt$(S , l Ui?))nL(Run cl );C) | u G j8}) = 
Shuf({(7r- 1 (a^ e $(5 l U fl)) n L(Run B j); Q I e G N+} U 

{(tt-^uIJS^ U R)) n L(Run B i); Q | u G /?}) ^U 33 

GD, CE3 . , , 

Shuf ({a; • i + £I e | e G N+} U {w • i + M | M G .M 1 }) Si 
Similarly, we can show: 

( 7 r- 1 (L(^ +1 )[& 1 tt]) n LCRun^+i); Q = Shuf({w • i + M | M G M*}) ^ M i+1 , 

U 



( 7 r- 1 (L(^ +1 )[6 2 tt]) n LfTtun^+i); Q = Shuf({w • i + M | M e M 1 } U {co • i + A'*}) £ iT +1 . 



Let „4 4+1 = A\ +1 W „4 2 +1 w ^3 +1 - lt is eas y to see that L{A l+1 ) = ((a+B) fc U foxtt U 6 2 tt)SJ2' for 
some regular language JZ' C -Ei} with first(JZ') C {0,1}. Hence A l+1 satisfies the statement in 
Proposition l25l 

6.2.4 Second inductive step: automatic presentations for 2/1 . , i^ 1 ^ 1 , Af l + 1 for i 
even Using the same technique, we can construct automatic presentations for (c G 

M t+1 , and A l+1 in case i is even. We first define the automata B\, B\, and B\ as in ([28 ]l -([30" ]) . 
with (3 = b\ this time. Then we construct 

C\ = B\ U (a+ij) fe £|, = W Bj), Cl = ftajjflj. 

We define the following automata by applying Lemma [26l 

„41 +1 = a(a(a + tf) fe ), ^ 2 +1 = <7(^,M). A +1 = <r(ClM). 

By Lemma [26l it is easy to check the following: 

Vc G N k + : {■k- 1 {L{A\ +1 )[o s ]) n L(Run^i+i); Q = Shuf({o; ■ i + CX CX \ x G N+} U ■ i + K 1 }) 

~ ri+l 

— ) 

(7r- 1 (L(^ 2 +1 )[6 1 )J]) n ^(Run^i+i); C) ~ Shuf({w • i + M 1 } U • i + A 1 }) 

= Af 4+1 , 

(Tr- 1 (L{Al +1 )[b 2 $}) n L(Run^ + i); C) = Shuf({w • i + K 1 }) 

= K l+l . 

Let A l+1 = A\ +1 a »4 2 +1 W It is easy to see that L(A l+1 ) C ((a+ti) fc U U for 

some regular language R 1 C with ffrst(iZ') C {0, 1}. Hence _4 4+1 satisfies the statement 

in Proposition [25] This finishes the construction in the inductive step and hence the proof of 
Proposition [25] Hence we obtain: 

Theorem 29. The isomorphism problem for the class of automatic linear orders is at least as 
hard as FOTh(N; +, x). 
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In [15], it is shown that every linear order has finite FC-rank. We do not define the FC-rank of 
a linear order in general, see e.g. [T5J. A linear order (L, <) has FC-rank 1, if after identifying 
all x,y G L such that the interval [x,y] is finite, one obtains a dense ordering or the singleton 
linear order. The result of [151 mentioned above suggests that the isomorphism problem might be 
simpler for linear orders of low FC-rank. We now prove that this is not the case: 

Corollary 30. The isomorphism problem for automatic linear orders of FC-rank 1 is at least as 
hard as FOTh(N;+, x). 

Proof. We provide a reduction from the isomorphism problem for automatic linear orders (of 
arbitrary rank): if (L, <) is an automatic linear order, then so is (K, <) = ((— 1, 0] + [1, 2)) • (L, <) 
(this linear order is obtained from L by replacing each point with a copy of the rational numbers 
in (-1,0] U [1,2)). Then (K , <) has FC-rank 1: Only the copies of and 1 will be identified, 
and the resulting order is isomorphic to (Q, <). Moreover, (L, <) is isomorphic to the set of all 
x G K satisfying 3z > x My : (x<y<z^y = z). Hence (L, <) = (L' , <') if and only if 
((-1, 0] + [1, 2)) • (L, <) = ((-1, 0] + [1, 2)) • (£', <'), which completes the reduction. □ 

7 Conclusion 

This paper looks at the isomorphism problem of some typical classes of automatic structures. 
Such classes include equivalence structures, successor trees of height at most n G N, and linear 
orders. In particular, we demonstrate, respectively, Tl^-completeness and 7J2 n _ 3 -completeness for 
the isomorphism problem of the first two classes. The uniformity in our proof shows that the 
isomorphism problem of automatic trees of finite height is recursively equivalent to FOTh(N; +, x). 
Similarly, we prove that the isomorphism problem of automatic linear orders is at least as hard 
as FOTh(N;+, x). The same technique is also used to proved that the isomorphism problem of 
recursive trees of height at most n is il^-complete. 

We conclude with an application of Theorems [18] and [29] The following corollary shows that 
although automatic structures look simple (especially for automatic trees), there may be no "sim- 
ple" isomorphism between two automatic copies of the same structure. An isomorphism / between 
two automatic structures with domains L\ and L 2 , respectively, is a ^-isomorphism, if the set 
{(x,f(x)) I x G Li} belongs to E%. 

Corollary 31. For any k G N, there exist two isomorphic automatic trees of finite height (and 
two automatic linear orders) without any -isomorphism. 

Proof. Let T\ = (D\\ E\) and T2 = (D 2 ; E 2 ) be two automatic trees. Let P\{x, y), P 2 (x, y), ■ ■ ■ be 
an effective enumeration of all binary ^-predicates. This means that from given e > 1 we can 
effectively compute a description (e.g. a Z^-formula over (N;+, x)) of the predicate P e (x,y). We 
define the statement iso(Ti,T2,fc) as follows: 

3e Vxi,X2 G Di 3yi,y2 € D 2 : P e (xi,yi) A P e (x 2l y 2 ) A 

(xi = x 2 o y x = y 2 ) A ((xi, x 2 ) G E\ (yi, y 2 ) G E 2 ) 

A My G D 2 3x e Di : P e (x,y) 

Since P e is a I7°-predicate, this is a Z , ° +2 -statement, which expresses the existence of a E®- 
isomorphism from T\ to T 2 . 

By Theorem [TH] there is a natural number n such that the isomorphism problem on the class 
T n of automatic trees of height at most n is l^fc-i-3-hard. If for all Ti, T 2 G 7~ n with T± = T 2 there 
exists a ^"-isomorphism from T\ to T 2 , then the isomorphism problem on T n reduces to checking 
existence of a ^-isomorphism, which is in E® +2 by the above consideration. Hence, there must 
be T\,T 2 G T n with Ti = T2 but there is no ^-isomorphism between them. 

The corollary for linear orders can be proved in the same way, where in the definition of 
iso(Ti, T2, fc) we replace {x\,x 2 ) G E\ O (yi,y 2 ) G E 2 with x\ <i x 2 y\ < 2 y 2 , where <i and 
<2 are the linear orders of T\ and T2, respectively. □ 
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